Behavioral Profiles: a fine-grained and quantitative approach in corpus-based lexical semantics
نویسنده
چکیده
The domain of linguistics that has probably been studied most with corpora is lexical semantics. The main assumption underlying nearly all corpus-based work in lexical (and constructional) semantics is that the distributional characteristics of a linguistic expression reveal many if not most of its semantic and functional properties. The maybe most widely-cited statement to this effect is Firth's (1957:11) famous dictum that "[y]ou shall know a word by the company it keeps." However, other quotes may be actually even more explicit and instructive, such as Bolinger's (1968:127) statement that "a difference in syntactic form always spells a difference in meaning" or Cruse's (1986:1) statement that "the semantic properties of a lexical item are fully reflected in appropriate aspects of the relations it contracts with actual and potential contexts." Most explicit in this regard is Harris (1970:785f.):
منابع مشابه
Running Head: Behavioral Profiles 1 Behavioral Profiles: A fine-grained and quantitative approach in corpus-based lexical semantics
This paper introduces a fairly recent corpus-based approach to lexical semantics, the Behavioral Profile (BP)approach. After a short review of traditional corpus-based work on lexical semantics and its shortcomings, I explain the logic and methodology of the BP approach and exemplify its application to different lexical relations (polysemy, synonymy, antonymy) in English and Russian with an eye...
متن کاملInferring Semantics from Collocation Clusters to Represent Verbs and Nouns
Current lexical semantic theories provide representations at a coarse grained level. In this paper, I will provide motivations for a fine grained representation for verbs and. nouns. An initial case study is done to serve as evidence that a more detailed representation is needed for tasks that require high accuracy rates, such as machine translation. An automatic approach to gather fine grained...
متن کاملThat's So Annoying!!!: A Lexical and Frame-Semantic Embedding Based Data Augmentation Approach to Automatic Categorization of Annoying Behaviors using #petpeeve Tweets
We propose a novel data augmentation approach to enhance computational behavioral analysis using social media text. In particular, we collect a Twitter corpus of the descriptions of annoying behaviors using the #petpeeve hashtags. In the qualitative analysis, we study the language use in these tweets, with a special focus on the fine-grained categories and the geographic variation of the langua...
متن کاملExtending Fine-Grained Semantic Relation Classification to Presupposition Relations between Verbs
In contrast to typical semantic relations between verbs, such as antonymy, synonymy or hyponymy, presupposition is a lexical relation that is not very well covered in existing lexical resources. It is also understudied in the field of corpus-based methods of learning semantic relations. But presupposition is very important for the quality of automatic semantic and discourse analysis tasks. In t...
متن کاملA fine-grained and quantitative approach in corpus-based lexical semantics
This electronic file may not be altered in any way. The author(s) of this article is/are permitted to use this PDF file to generate printed copies to be used by way of offprints, for their personal use only. Permission is granted by the publishers to post this file on a closed server which is accessible to members (students and staff) only of the author’s/s’ institute, it is not permitted to po...
متن کامل